EQUIDISTRIBUTION OF DENSE SUBGROUPS ON NILPOTENT LIE 

GROUPS 



EMMANUEL BREUILLARD 

Abstract. Let F be a dense subgroup of a simply connected nilpotent Lie group G 
generated by a finite symmetric set S. We consider the n-ball 5"" for the word metric 
induced by S on F. We show that S*" (with uniform measure) becomes equidistributed on 
G with respect to the Haar measure as n tends to infinity. We also prove the analogous 
result for random walk averages. 



1. Introduction 

In this paper we are concerned with equidistribution properties of dense subgroups of 
Lie groups. Let G be a Lie group and S = {1, Oi, a^, a]"^, a^^} a finite symmetric set 
of elements in G, which we assume to generate a dense subgroup F in G. Let S"" be the 
n-ball in the Cayley graph of F induced by S", i.e. the set of elements of F that can be 
written as a product of at most n elements from S. We give ourselves two open subsets U 
and V in G with Lebesgue-negligible boundary. Consider the ratios 

where \A\ denotes the cardinal of a set A. The question is to find out whether or not 
RniU, V) converges and if it does to determine the measure m on G such that 

m(U) 

(1) lim Rn{U,V) = ^-{ 

n->+oo m{V) 

Various kinds of averages are possible instead of the uniform averages over S*" above. In 
particular one may consider random walk averages or averages over the abstract free group 
generated by 5*. Limits such as ([1]) and results of this kind are usually called ratio limit 
theorems (see [19], [T6]). 

Arnol'd and Krylov were among the first to consider such a question and they showed 
in [2] how to use the representation theory of G to quickly give a positive answer to this 
problem when G is a compact Lie group and 5* is the set of free generators of a dense free 
subgroup of rank s in G. Then ([T]) holds with m the normalized Haar measure on G. 

In this paper we prove that ([1]) holds with m a Haar measure when G is any closed 
subgroup of a simply connected nilpotent Lie group ( Corollary [L2] below). The case of semi- 
simple Lie groups (even 5*1/2 (M)) remains an open question up to now. In [17], Kazhdan 
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considered the case when G = Isom{W^)° for d = 2 and obtained an analogous ratio hmit 
theorem for random walk averages. His argument was corrected and the result extented 
by Guivarc'h in [16] (see also [7] for the precise asymptotics) but the case when > 2 is a 
well-known open problem. 

Let us fix some notation and then state the main result of this paper. For a group G, we 
let voIg be a left Haar measure on G. Recall that according to a theorem of Guivarc'h (see 
[T5] and [8]), if G is nilpotent, locally compact and compactly generated, then there is an 
integer d{G) G N such that voIg{^^) ~ n^^'^^ for any compact generating set f2. When G is 
connected and simply connected, the integer d{G) is given by the Bass-Guivarc'h formula 
(Q below. We show: 

Theorem 1.1. (Dense subgroups are equidistrihuted) Let T he a finitely generated nilpotent 
group and S = {1, Oi, a^, a^^, a~^} a finite generating set. Let G a closed subgroup of 
a simply connected nilpotent Lie group. Suppose (j) :T ^ G is a homomorphism with dense 
image. Then there is a positive constant C > such that 

(2) hm ''^",?pf'!f,f ' = C-voIg{B) 
for every bounded Borel subset B G G with negligible boundary. 

Corollary 1.2. If G is a closed subgroup of a simply connected Lie group, then (J\) holds 
for m a Haar measure on G. 

This result can be seen as a generalization of the classical Weyl equidistribution ([23]) 
of multiples of an irrational number a modulo 1. Indeed, let G = M, F = Z^, and 5* = 
{(0,0), (0,±1), (±1,0)}, 0(x,?/) = x + ay and B = (a, 6) C [0,1]. Then in this case 
translates as ^ 

lim — \{k G [— n, n\,ka G (a, b) mod 1 }\ = b — a. 

For abelian G the theorem can be deduced from Weyl's equidistribution. However, as 
we will see below, when G is nilpotent, it requires different techniques. 

Combining Alexopoulos' theorem [1] on the asymptotics of the return probability of 
random walks on finitely generated nilpotent groups with Theorem 11.11 allows to prove the 
analogous limit theorem for random walk averages, namely: 

Corollary 1.3. (Local Limit Theorem) Let G be a simply connected nilpotent Lie group 
and fi a symmetric and finitely supported and probability measure on G whose support 
generates a dense subgroup, then there is c(/i) > such that for any Borel set B with 
Lebesgue-negligible boundary, 

(3) lim n-'^(^)/V*''(5) = c(/i) ■ volciB) 

n— >+oo 

where /i*" denotes the n-th fold convolution power of fi. 

The proof of Theorem 11.11 makes use of two crucial ingredients. First, we need precise 
information on the shape of the n-balls S"" in F, and this is essentially provided by Pansu's 
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theorem from [20]. Second, we use a now well-known principle from ergodic theory (see |10j . 
[18] ) according to which the ergodic properties of the action of F on a homogeneous space 
N/M are dual to those of the action of M on N/T. The unique ergodicity of unipotent 
flows on nilmanifolds, which is an "ancestor" of Ratner's theorem, allows then to reduce 
the equidistribution statement to a computation of the asymptotic volume of cosets of M 
inside large balls in A^. The goal of Section 2 is to prove this volume estimate (Proposition 
I2.14P and give some background on homogeneous quasi-norms on nilpotent Lie groups. In 
Section 3, we complete the proof of Theorem 1 

N.B.: a. We prove (see Theorem 13.11 and 13. 5p that the limit ([2]) exists also for more 
general averages of the form B{n) fl (f)~^{U) in place of S*" H (/>~^(f/), where B{n) is the 
n-ball for a quasi- norm on F, or any left-invariant coarsely geodesic distance. 

b. The techniques of this paper allow to get uniformity of convergence in ([2]) (resp. ([3])), 
when B is allowed to vary among translates xB such that the distance between x and 1 is 
a o{n) (resp. o(\/n)), and a uniform upper bound for the left hand side of ([2]) holds for all 
translates. But we will not need these refinements here. 

c. Our equidistribution problem of a dense subgroup F in a nilpotent Lie group G can be 
phrased more generally as the question of whether F-orbits equidistribute in their closure 
on a homogeneous space N/M. Here we treated the case of M normal in and a dense 
orbit. The general case, when is nilpotent, is slightly more involved but can be treated 
by similar methods. 

d. In |9] we showed, in the case when G is the Heisenberg group, that ([3]) holds for 
more general measures, namely any centered and compactly supported measure /i on G. 
The methods of [9] use representation theory and are of a very different nature from the 
proof displayed in the present paper. 
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2. Homogeneous quasi-norms and volume of balls on nilpotent Lie groups 

The main goal of this section is to prove the following result for a simply connected 
nilpotent Lie group A^. 

Proposition 2.1. For any quasi-norm on N, the balls {Dt)t>o, Dt = {\x\ < t}, form a 
nicely growing family of subsets of N. 

Quasi-norms on and nicely growing subsets are defined below. This statement essen- 
tially means that given a left invariant metric on A^, we can compute the asymptotics of 
the volume of balls for the induced metric on a closed connected subgroup. To prove it, we 
will need to introduce some background on nilpotent Lie groups. This is also the second 
goal of this section. In the next two paragraphs we deal exclusively with filtered vector 
spaces, while in the remaining four we apply the results to nilpotent Lie groups. 

2.1. Filtrations on vector spaces and exterior powers. On a nilpotent Lie algebra, 
there is a canonical filtration given by the descending central series. In this paragraph, we 
describe some properties of filtrations and associated degree functions in the more general 
context of vector spaces. This allows to attach a degree degy{W) to every vector subspace 
W, a notion that will be useful when proving Theorem 12. 14[ The content of this paragraph 
is probably well known but we couldn't find an adequate reference for it. 

2.1.1. Filtrations and degree on V and A*V. Let ^ be a real vector space equipped with a 
filtration, i.e. a non-increasing finite sequence of vector subspaces V = Vi ^ V2 ^ ... ^ 

Proposition 2.2. Associated to this filtration is a function deg : ^ ^ N called the degree 
defined for V G V by deg{v) = maxj>i{2,f G V^}. This degree function extends in a canonical 
way to the exterior power A*V. 

To see this, one can for instance consider a basis (ei, e„) of V which is adapted to the 
filtration (Vi)j in the sense that Vi = span{ek\k = l,...,n, deg(ejfc) > i}. This basis gives 
rise to an associated basis for A*V given by the cj = A ... A ejj.'s where / = {zi, 
ranges over all subsets of {1, ...,n}. We can then define the degree of a basis element by 

(4) deg(e/) = deg(eij + ... + deg(eij. 

Subsequently, this defines a filtration on A*V by letting Vi = span{ej\I C {1, n}, deg(e7) > 
i}. In turn, we get a degree function on A*V extending the definition (jl]) by setting 
deg(0 = niaxj>i{2,^ G Vj}. It is just a matter of simple verification to check that the 
filtration {Vi)i and the degree thus defined on A*V are independent of the choice of an 
adapted basis. 

2.1.2. Induced filtration on a subspace, degree of a subspace. Let be a vector subspace 
ofV. 

Proposition 2.3. The operations of restricting to a subspace W and extending to the exte- 
rior power commute and give rise, after composition, to a uniquely defined degree function 
deg : A*W N. 
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Proof. Given an adapted basis of W with respect to the filtration (W fl Viji, it is possible 
to complete this basis into a basis (ei, e„) of V which is adapted to the original filtration 
{Vi)i. So we easily check that 

WrW^ = span{ei\I C J^, deg(e/) > i} = A*W n Vi 

where Jw is the set of indices such that W = span{ei,i G Jw}- It follows that there is a 
unique notion of degree on A*W associated to the original degree on V. □ 

In particular, this allows to define the degree of a subspace W by setting degy{W) = 
deg(/i A ... A fk), where (/i, /fc) is any basis of W (observe that deg is really defined on 
the projective space P(A*l^)). If (ei, Cfc) is an adapted basis for the filtration {W fl Vi)i, 
then (jl]) yields 

k 

(5) degy{W) = deg(ei A ... A e^) = ^ deg(e,) = ^ dim{W n Vj). 

i=i j>i 

Finally if {St)t>o is the one-parameter group of endomorphisms of V defined by St{ei) = 
ideg(ei)g. fQj. some adapted basis (ei, ...,e„). We easily check the following: 

Proposition 2.4. The 6t's extend canonically to h*V and for any non-zero ^ G A*V, 
f^^tiO i^nds to a non-zero limit in A.*V as t if and only if d = deg(^). 

2.2. Homogeneous quasi-norms on filtered vector spaces. Let be a real vector 
space with a filtration = Vi 3 V2 ^ ... ^ K ^ = {0}. We say that {5t)t>o 

is a one-parameter group of dilations associated to this filtration if the 5ts are linear 
automorphisms of V such that, for every i = 1, the eigenspace rrii = {x E V^5t{x) = 
fx} is independent of t (t 7^ 1) and is a (possibly trivial) supplementary vector subspace 
of Vi+i inside Vi, i.e. 

(6) Vi=mi®Vi+i. 

The r subspaces m^'s are determined by {St)t>o and vice- versa any choice of r subspaces m^'s 
verifying ([6]) determines a unique group of dilations. If = 0]^<j<,, rrii and V = 0i<j<r m'^ 
are two choices of supplementary subspaces, then the associated one-parameter groups of 
dilations satisfy the relation 6[ = o 5^ o where (j) is the coordinate change from the 
first direct sum to the second. Also we have uniformly on bounded subsets of V: 

(7) lim 61 o (h o St = id. 
We now introduce the following definition. 

Definition 2.5. A continuous function \ ■ \ : V ^ M+ is called a homogeneous quasi- 
norm associated to the dilations {St)t, or simply a quasi-norm, if it satisfies the following 
properties: 

(i) |a:| = -^^ X = 0. 

(ii) \St{x)\ = t\x\ for all t > 0. 
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Examples of quasi-norms are given by supremum quasi- norms of the type = maxp ||7rp(a; 
where are ordinary norms on the vector space rrip and TTp is the projection on rUp ac- 
cording to the decomposition V = 0i<j<j."^j. More examples will be given below in the 
context of nilpotent Lie groups. 

Clearly, two quasi-norms associated to the same one-parameter group of dilations are 
equivalent in the sense that ^ |-|^ < |-|2 < c\-\^ for some constant c > 0. Furthermore, 
using ([7]), we check the following: 

Proposition 2.6. If \-\' is a quasi-norm that is homogeneous with respect to the one- 
parameter group associated to the decomposition V = 0i<j<^ m'^ then there is a unique 
quasi-norm \ ■ \ that is homogeneous with respect to {6t)t such that 

\x\ — \x\' = o{\x\) 

as \x\ is large. In fact \x\ = 

As will be observed below in Section Ti. 6. 2[ to every reasonable left-invariant distance on 
a simply connected nilpotent Lie group N is associated a unique quasi-norm that is asymp- 
totic to it. In particular, every large ball for a left-invariant distance is well approximated 
by some quasi-norm ball. This fact makes the volume computations needed in our main 
theorem possible because, thanks to their scahng property, such computations are easy in 
the case of quasi-norm balls. 

2.2.1. Invariance under restriction to a subspace or projection to a quotient. Let | ■ | be a 
homogeneous quasi-norm associated to some fixed one-parameter group of dilations {St)t>o 
and let Dt := {x G V, \x\ < t} be the corresponding quasi-norm ball. Let be a vector 
subspace of V endowed with the induced filtration {W fl Vi)i and let {6^)t be some one- 
parameter group of dilations on W with respect to that filtration. Although the restriction 
of I ■ I to may not be a quasi-norm on W, the following holds: 

Proposition 2.7. There exists a unique homogeneous quasi-norm \ ■ \q on W such that 
\x\ — \x\q = o{\x\) if X E W and \x\ is large. In particular, if is the homogeneous 
quasi-norm hall for | ■ |o on then there exists Et > 0, Et as t +oo, such that 

Similarly, we can consider the quotient vector space V/W endowed with the induced 
filtration {Vi/Vi fl W)i and with some choice of a one-parameter group of dilations {5t)t- 
Let 71 : V —>■ V/W be the canonical projection and let \y\.„ := inf{|x|, n{x) = y}. Although 
I ■ 1^ may not be a quasi-norm on V/W, the following holds: 

Proposition 2.8. There exists a unique homogeneous quasi-norm | ■ |i on V/W such that 
Ivln — \y\i = o{\y\T,) if y E V/W and \y\n is large. In particular, if Dj is the homogeneous 
quasi-norm ball for | ■ |i on V/W, then there exists Et > 0, Et ^ as t +oo, such that 

We leave the proof of these propositions as an exercise. 
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2.3. Volume growth. Let be a simply connected nilpotent Lie group and (C^(A^))p=i,...,r 
its descending central series. The integer r is the nilpotency length, that is the largest r 
for which C^{N) is non trivial. We identify with its Lie algebra n via the exponential 
map, which is a diffeomorphism. The Lie product is a polynomial function on n x n and 
this makes N into a real algebraic group with a Zariski topology. By Theorem 2.1 of (21j . 
a closed subgroup of is Zariski-dense if and only if it is co-compact. Furthermore, any 
closed subgroup if of is contained in a unique minimal connected closed subgroup H of 
A^. The subgroup H is Zariski closed, simply connected, and H/H is compact. 

It was proved by Guivarc'h in [T5] (and independently by Bass [5] in the special case 
of finitely generated nilpotent groups) that if G is a closed co-compact subgroup of A^ 
and f/ be a compact generating neighborhood of the identity in G, then there are positive 
constants Ci and C2 such that for any positive integer n 

(8) Ci ■ n^(^) < volciU'') < C2 ■ n'^(^) 

where d{N) is an integer called the homogeneous dimension of A^ and is given by the 
Bass-Guivarc'h formula: 

(9) diN) = ^dim(CP(Ar)) = ■ dim{CP{N)/CP+\N)). 

p>i p>i 

For general H as above we set d{H) = d{H). For instance, if F is any nilpotent group 
generated by a finite symmetric set S then, we can view it, modulo its finite torsion group, 
as a lattice in a simply connected nilpotent Lie group A^ according to a theorem of Malcev 
chp. 2). Hence d(T) = J2p>iP " rk{CP(T)/CP+\T)) and by the ball ^" of radius 
n in the word metric defined S has, up to multiplicative constants, n'^^^^ elements. 

The estimate ([8]) was later refined by Pansu who showed in [20] that \S'^\/n'^^^^ has a 
non-zero limit when n +00. In fact, Pansu's argument can be adapted (see [S] for 
details) to extend his result to all closed subgroups of A^, namely: 

Theorem 2.9. Let G be a closed subgroup of N and U be a compact generating neighborhood 
of the identity in G. Then there is a positive constant AsVol{U) > 

Note that the homogeneous dimension d{N) coincides the degree deg„(n) defined in (jSj) 
where the filtration on n = Lie{N) is given by the central descending series. If M is a 
normal closed and connected subgroup of A^, then we check that deg„(m) = d{N) — d{N/M). 

2.4. Polynomials, dilations and quasi-norms. We recall here a few well-known facts 
about the analysis on nilpotent Lie groups (see fl3j). 

2.4. L Degree of an element and a polynomial. Let A^ be a simply connected nilpotent Lie 
group, which we identify to its Lie algebra n via the exponential map exp : n ^ N. We 
say that a map P : A^ M is polynomial if x i— > P(exp(x)) is a polynomial map on the 
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real vector space n. The central descending series (C^(n) )a:>i gives a canonical filtration 
on n and hence induces a degree function on n (as in Prop. I2.2I) . 

Let (ej)j=i_...^„ be an adapted basis of n, namely we assume that C*(n) = span{ek \ deg(efc) > 
i} for all i > 1. We define the degree of a monomial to be d{a) := 

«! deg(ei) + ... + an deg(e„) for any multi-index a = (ai, a„), and the degree of an arbi- 
trary polynomial map to be the maximum degree of each of its monomials. This definition 
is easily seen to be independent of the choice of the adapted basis used to define it. 

The coordinates of the product of two elements in the basis (ej)j=i^ ,^„ are obtained from 
the Campbell-Hausdorff formula as follows (see [13] p. 14): 

(10) {xy)i = Xi + Hi + Pi{x,y) 

where Pi is a polynomial map on n x n of the following special type: 

P,(x,i/) = 5^Q^xV 
where d{a) + d{P) < deg(ej) and d{a) > 1, d{[3) > 1 and some constants Ca,(3- 

2.4.2. Associated graded algebra, dilations. Let (mp)p>i be a collection of supplementary 
subspaces on n as in (jS]) for = C*(n). If x G n, we write x = X]p>i '^pi^) where TTp{x) is the 
linear projection onto nip. Let {St)t>o be the one-parameter group of dilations associated 
to the mp's. The dilations 6t do not a priori preserve the Lie bracket on n. This is the case 
if and only if 

(11) [mp,mq] C rup+q 

for every p and q. If fill I) holds, we say that the (mp)p>i form a gradation of the Lie algebra 
n, and that n is a graded Lie algebra and N is called a Carnot group. 

If f llip does not hold, we can nevertheless consider a new Lie algebra structure on the real 
vector space n by setting [x, y]o = 7rp+q([x, y]) if x E nip and y G nig. This new structure no 
is graded and the {St)t>o are automorphisms. We denote by A^o the associated Lie group. 
In fact the original Lie bracket [x, y] on n can be deformed continuously to [x, y]o by setting 
[x,y]t = 6t{[6ix,6iy]) and letting t 0. 

On the other hand, the graded Lie algebra associated with n is by definition gr{n) = 
0p>^ C^(n)/C^'*'^(n). endowed by the Lie bracket induced from the Lie bracket of n. The 
quotient map nip C^(n)/C^^^(n) gives rise to a linear isomorphism between n and gr{n), 
which is a Lie algebra isomorphism between Uq and ^^(n). Hence graded Lie algebra struc- 
tures induced by a choice of supplementary subspaces (mp)p>i as in are all isomorphic 
to gr{n). 

2.4.3. Homogeneous quasi-norms on nilpotent Lie groups. Let | ■ | be a homogeneous quasi- 
norm on N with respect to some direct sum decomposition given by {mp)p^s. Let (ej)j be 
an adapted basis of n. 

Proposition 2.10. There is a constant C > such that 

(a) \Xi\ < C ■ \x\'^^^^'^^'> if X = XiCi + ... + XnCn- 
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(6) <C ■ \x\. 

(c) \xy\ < C{\x\ + \y\ + 1). 

It is straightforward to check (a) and (b). It can be a problem that the constant in 
(c) need not be 1. In fact this is why we use the word quasi- norm instead of just norm: 
we do not require the triangle inequality axiom to hold. However the following lemma of 
Guivarc'h is often a good enough remedy to this situation. Let be an arbitrary norm 
on the vector space rup. 

Lemma 2.11. ([15] lemme II. 1) Up to rescaling each into a proportional norm Ap 
(Xp > 0) if necessary, the quasi-norm \x\ = maxp ||vrp(a;) H^''^ satisfies \xy\ < \x\ + \y\ +c for 
some constant c > and for all x,y E N. Besides N is graded with respect to {6t)t if and 
only if c = 0. 

The proof is based on the Campbell-Hausdorff formula ffTOj) . Lemma [2.111 yields property 
(c) above and also is the key step to prove ([8]). 

Example 2.12. Note that one important class of quasi-norms consists of those of the form 
\x\ = d{e,x), where d{x,y) is a Carnot-Caratheodory Finsler metric induced on a graded 
nilpotent Lie group by some ordinary norm on the vector subspace mi . 

2.5. Nicely growing subsets. We define here nicely growing subsets. These are essen- 
tially Folner subsets with some extra properties that behave well under intersection with a 
connected subgroup. Our main result, Theorem [3] below, will hold for all such families of 
subsets. 

Recall that deg^ is the degree function from Paragraph 12.1.21 defined for all vector sub- 
spaces of n = Lie{N). 

Definition 2.13. We say that a family of measurable subsets {At)t^Q of N is nicely grow- 
ing if it satisfies the following properties: 

(i) {At)t>o increases and exhausts N, i.e. At C Ag if t < s and {J^^qAi = N. 

(ii) For every compact subset K G N, there exists a positive function St > with et ^ 
as t ^ +00 such that At^ist) ^ KAtK C At{i+£^) for all t large enough. 

{Hi) For any connected subgroup M of N, there exists a constant C (M) > such that 

Ito . C(M). 

(iv) There is a constant a > 1 such that AtA^^ C Aat for allt > 1. 

Additionally, ifV is a finitely generated torsion free nilpotent group, then a family (At)f>o 
of subsets of F is said to be nicely growing if there is a nicely growing family (^t)t>o of 
subsets of the Malcev closure of V such that V fl At{i^e^) C C F fl At{iJ^£^) for all t > 
and some positive function et > with et ^ as t ^ +oo. 

It will be convenient to broaden this definition a little bit by allowing different degree 
functions than deg^ in axiom {iii). By a degree function on A^, we mean any map deg 
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from the set of all connected subgroups of to N which is non-decreasing in the sense that 
Ml C M2 =^ degMi < degM2. Then we can speak of a nicely growing family of subsets of 
N relative to the degree function deg . By definition, such a family of subsets will satisfy 
all four axioms except that axiom {in) will now hold with deg(M) in place of deg^(M). 
The notion of nicely growing sets is stable under intersection with a connected subgroup 
say M, but then the degree function remains deg^ . 

Let us recall the statement of Proposition 12.11 which is our main goal here because it 
provides us with the many examples of nicely growing subsets. 

Proposition 2.14. For any quasi-norm on N, the balls {Dt)t>o, Dt = {\x\ < t}, form a 
nicely growing family of subsets of N. 

Proof. Property (i) is obvious and property (iv) follows from properties (b) and (c) of Prop. 
12.101 As for property (ii), it is a consequence of the following more general fact. 

Lemma 2.15. Let \ ■ \ be a quasi-norm on N. Then for every e > there exists 6 > and 
C > such that if \y\ > C and \x\ < 6\y\ we have 

\\xy\ - \y\\ < e\y\. 



Proof. Let | ■ |o be the quasi-norm obtained from Guivarch's lemma (Lemma 12.111) . Since 
any two quasi-norms are equivalent, |x|o = o(|?/|o) and |x| = o{\y\) are equivalent conditions. 
Using Prop. 12.101 (6) and Lemma [2. IH we see that if |x|o = o(|?/|o) then 

(12) 

Now write 



\xy\o - \y\o\ 



5i{xy) 

\v\o 

to 1 as \y 

iy) = 



\\xy\ - 


y\\ = 


\xy\o - 


s^iy) 

\y\o 



Siixy) 



\y\o 



\xy\o - 



6 1 



oi\y\o)- 
(y) 



y\o 



■ By ( ]T2l) . we get \\xy\ — \y\ 



1 



S 1 

as t +00 and \x 



{xy)6i{y ^) tends 

\v\o 

o{\y\). By compactness, we may even assume that 
y\o, we have reduced to showing that 6i{x6t{z)) — >• z 



+o{\y\). So it remains to show that 6 

tends to infinity and \x 
z is fixed and, posing t = 

o(t). To do this we use formula (fTOl) for the product in coordinates 



6i{x6t{z)) 



Xi 



+ Zi + Y,Ca 



with the additional constraints d{a)+d{i3) < deg(ej) and d{a) > 1, d{l3) > 1. The condition 
|x| = o(t) means that Xi = o{t'^'^^^^^^) for any index i. As t tends to infinity, we indeed obtain 
the convergence of the above expression towards Zi. 

-1 



□ 

is another 



Observe that the analogous result holds when xy is changed into yx {x \x 
equivalent quasi-norm) . 

We now turn to the proof of property {Hi). This is where we will need the discussion on 
filtrations from the previous sections. Let us denote by m = Lie{M) the Lie subalgebra of 
n = Lie{N) corresponding to M. Let d = dim(M) and (/i,...,/^) a basis for the vector 
space m. 
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The orthogonal linear transformations (for some Euclidean norm || ■ || on n) act transitively 
on Grain), the Grassmannian variety of rf-dimensional linear subspaces of n. Hence there is 
some orthogonal map Ot such that o'j'^di fixes m. Let voIm be a Haar measure on M, which 

we identify with Lebesgue measure on m. Let a^^ be the abolute value of the determinant 
of the endomorphism induced on m by o^'^Si. We have 



a. 



61 fi A ...A6ifd 



ll/iA...A/,|| 

We can estimate the behavior of at when t +00. As follows from ([5]) above, 
deg(/i A ... A fd) = ^dim(mn C^(n)) = deg^(M) 

Hence by Prop. 12.41 there is a non-zero C, G A'^n such that 
(13) 

Then we can define 

at ||/iA...A/d 



lim t'^^^^^^^^^i/i A...A5i/d = e 



(14) 



Cm '■= lim , 



m 



> 



By f[T^ the subspaces 6ixn converge to a limit subspace rrioo in the Grassmannian variety. 
So we could choose Ot so that Ot converges to some o as t ^ +00. Then trioo = om. Also 
observe that iTioo is invariant under the full one-parameter group of dilations {St)t- 

Lemma 2.16. The subspace vcioo is a Lie subalgebra of no. 

Proof. We need to show that if x,y E rrioo, then x ■ y E nXoo, where x ■ y is the product in 
Uq. Let Xt, yt in m be such that x = lim^^+oo Si{xt) and y = limt^+oo Si{yt). Then by (|TOl) 
we have 



hixtyt) - Slixt) ■ 61 {yt 

da+d0<deg{ei) 

By Prop. 12.101 (a) this expression is a 0(j). Hence x ■ y = lim^^+oo Si{xtyt) 
moo- 



i.e. X ■ y E 
□ 



Let voIm^ be a Haar measure on M^o, again identified with Lebesgue measure on rrioo. 
Fixing the Haar measure on A^, the choice of a Euclidean norm on n specifies a normalization 
for voIm and voImoo- Then we have: 

Lemma 2.17. We have the following weak convergence of measures: 

1 



t^+00 t<ieg^{m) 



5i ) voIm = Cm ■ vol Mo. 
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Proof. By ( |T4|) we have 



lim -3 (Si^ voIm = lim , , , (oj)^ wo/m = cm ■ "^oZ^ 



□ 

By Lemma [2.171 applied to Di, we get the desired convergence, namely 

volM{MnDt) , 



after we check that volMao{9Di) = 0. This is clear however since, using the invariance of 
Moo under {St)t>o, the function volM^{Dt) = \ J(^c{{6t)\Moc)\ ' '^olM^iDi) is continuous as a 
function of t. This ends the proof of Proposition 12.141 □ 

2.6. Other examples of nicely growing subsets. Other types of balls give rise to nicely 
growing subsets besides quasi-norm balls. We show here two more examples: balls obtained 
by considering exponential coordinates of the second kind, and more importantly p-balls 
for any "reasonable" left-invariant distance p on N. 

2.6.1. Balls in privileged coordinates. Let (ej)i<j<„ be an adapted basis as in Paragraph 
12.4.11 and be the associated exponential coordinates of the first kind, i.e. x = XiCi + 
... + XnCn- For z = 1, n, let Pj be a polynomial map on n of total (homogeneous) degree 
< di = deg(ej), with Pi(0) = 0. The map Pi can be split in two parts, Pi{x) = Li{x) + Mi{x) 
where Li{x) is a linear form on n depending only on those coordinates Xj such that dj = di 
and where Mj(x) is a polynomial depending only on those x/s with dj < di — 1. Assume 
further that the Lj's, i = l,...,n, are linearly independent. Then, following [6], we call 
privileged coordinates any choice of coordinates on n that is obtained from the Xj's by a 
coordinate change : a; h-> x' of the form 

(15) x[ = P,{x) 

For instance, writing x in the associated exponential coordinates of the second kind, i.e. 

X = exp^x'iCi) ■ ... ■ exp(a;^e„) 
it is easy to check that (fT^ holds and we even have Li{x) = Xj. We have: 

Proposition 2.18. Let |-| be a homogeneous quasi-norm on N and (p : x ^ x' a privi- 
leged coordinate change. Then there exists a unique homogeneous quasi-norm |-| such that 
\4>{x)\ = \x\' + o(|x|'). In particular the balls {|0(x)| < t} form a nicely growing family if t 
tends to +oo. 

Proof. Let us write 0(x) = ip{x) + ri{x) where {ip{x))i is the component of Pi{x) that 
is homogeneous of homogeneous degree d^. Then clearly, the map x ^— |'?/'(a;)| is a ho- 
mogeneous quasi-norm (property (i) of Def. 12.51 follows from the linear independence 
of the Lj's). Therefore, after composing by "V^"^, we may assume that il){x) = x, i.e. 
0(x) = X + rj{x) where each {ri{x))i is a polynomial map of degree < di — 1. We wish to 
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show that \(p{x) \ = \x\ + o(|a;|). Since \ri{x) \ = o{\x\) the proof is a straightforward copy of 
that of Lemma I2.15[ 

Given e > 0, we thus get C {|0(x)| <t}c -Dt(i+e), for all large t, where Dt = 

{\x\ < t}. By Prop. 12.141 this clearly implies all of the four defining properties for nicely 
growing subsets. □ 

2.6.2. Balls for left-invariant metrics. Following [Hj, we say that a distance function p on 
iV is a periodic metric if it is left-invariant under some co-compact subgroup of and is 
asymptotically geodesic, namely for every e > there is s > such that \/x, y & N, one 
can find points Xi = x, X2,--.,Xn = y in N such that p{x,y) > (1 — £)Y17=i Pi^i^^i+i) 
and < s for each i. This means that we require a kind of weak existence of 

geodesic axiom. Examples of such metrics are given by left-invariant Riemmanian or sub- 
Riemmanian metrics on N, and also by word metrics induced by a compact generating set 
of AT (see [8j). 

In [201, Pansu associates to any such p a Carnot-Caratheodory metric dp on in the 
following way. Let {St)t be a one-parameter group of (linear) dilations on n and n = ©pm^ 
the eigenspace decomposition, with tti the projection to mi. This yields a graded struc- 
ture no on n as defined in I2.4.2[ Let Eg is the closed convex hull of all ni{x)/p{e,x) with 
X & N and p{e,x) > s and E = f]^^QEs. Pansu shows that E is a. compact symmetric 
subset of mi with non-empty interior. Hence it is the unit ball of some norm ||-||^ on mi. 
In turn, this norm defines a Carnot-Caratheodory Finsler distance dp on A^o by setting 
dp{x,y) = inf{L(7),7 horizontal path from x to y}, where horizontal means almost every- 
where tangent to a riQ-left translate of mi and L{'j) is the length of 7 measured according 
to \\-\\p - Pansu's main result reads (see [20], or [S] for a proof):. 

Theorem 2.19. ([20\) For any periodic metric p on N, 
(16) hm ^ 



+00 



dr. 



Corollary 2.20. The balls Bp{t) = {x G N,p{e,x) < t} form a family of nicely growing 
subsets of N. 

Proof. Clearly x dp{e, x) is a homogeneous quasi-norm, so the Corollary follows from 

Prop, [mi □ 

When proving Theorem 1 1.1 1 we will apply this result to the case when p{x, y) = ^(7^:, 7^^), 
where d is any word metric on a lattice F in A^, and x G 'jxF for some fixed compact 
fundamental domain F of F in A^. 

3. EQUIDISTRIBUTION 

In this section, we prove Theorem ll.il via the following version of it: 

Theorem 3.1. (Dense subgroups are equidistributed) LetG be a closed subgroup of a simply 
connected nilpotent Lie group and F a finitely generated torsion-free nilpotent group. Let 
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(f) -.T ^ G he a homomorphism with dense image. Suppose that {At)t>o is a nicely growing 
family of subsets of T (see Definition \2.13\) . Then there is a positive constant Ci > 
depending on {Kt)t>Q o-nd on the choice of a Haar measure vole on G, such that for any 
bounded Borel subset B G G with negligible boundary we have 

(1^) f^m^) tZIoo ■ ^^^^(^^ 

where d{r) and d{G) are the integers defined in \2.3[ 

Recall that by Malcev's theory (see [2Tj Theorem 2.18), every finitely generated torsion- 
free nilpotent group embeds in a simply connected nilpotent Lie group, its Malcev closure. 
Let N be the Malcev closure of V. Let {Dt)t>o be the family of nicely growing subsets of 

such that V fl Dt(i-£t) C C F fl Dt(i+et)- First observe that it is enough to prove the 
theorem for sets A^ of the form Af = F fl Df. 

Before starting the proof of Theorem 13.11 let us briefly explain how one can also reduce 
to the case when G is connected. Recall that since G is closed in a simply connected 
nilpotent Lie group, it is co-compact in the simply connected subgroup G (its Zariski- 
closure) deflned in 12.31 By Malcev's rigidity (see [21] Theorem 2.11) extends to an 
epimorphism (j) : N ^ G, which gives rise to an isomorphism N/Nq ^ G/G°, where 
Nq = F° is normal in N and contains M = ker0, and F° = F fl (f)^^{G°). For every 
bounded Borel subset of G, the set {7 G F,7 G Dt,4>{'y) G B} can be split into a flnite 
number of translates of {7 G F°, 7 G 7i-Dt, ^(7) G Bi} where (j){^~^)Bi C B and 7i G F and 
Bi G G°. Since G° is simply connected and F° dense in it, we may just as well work with 
these groups. However Dt n A^'o is a nicely growing family of subsets of A'o only relative 
to the degree function deg^y and not relative to deg^^ (see the remarks below Deflnition 
I2.13p . Nevertheless, we show below that if is a nicely growing family of subsets of A^ 
with respect to an arbitrary degree function, then under the hypothesis and notation of 
Theorem 13.11 

(18) *beVreD.,My)eB} ^ 

We will thus assume below that G is connected and simply connected. 

3.1. Unique ergodicity and counting. By Malcev's rigidity ([21] Theorem 2.11), </> 
extends to an epimorphism (p : N —>■ G. Let M = ker^. We want to flnd the asymptotics 
of the number of points of F fl which lie in (j)~^[B), the inverse image of the bounded 
Borel subset B by the map 0. Hence we are dealing with a counting problem, which we 
will treat via ergodic theory. The use of ergodic theory to solve counting problems is now 
standard (see for instance [12], [11] and [4J for a survey of these techniques) and what we 
are going to present here is yet another illustration of these ideas. 

It is a fairly general principle in ergodic theory that the ergodic properties of the action 
of a closed subgroup Hi of a group H on the homogeneous space H/H2 can be deduced 
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from the ergodic properties of the action of the closed subgroup H2 on H/Hi and vice- 
versa. Here we will deduce the equidistribution of F in G ~ N/M from the equidistribution 
of an M-orbit on the nilmanifold N/T. In order to do this, we first recall the following 
well-known theorem (see |22j Theorems 3.6 and 3.8, and also ^ Lemma 5.1): 

Theorem 3.2. (Unique ergodicity criterion for nilflows) Let T be a co-compact lattice in a 
simply connected nilpotent Lie group N and let M be a closed subgroup of N. The following 
are equivalent: 

(i) The subset MT is dense in N. 

(ii) M acts ergodically on N/T. 

(Hi) The M-action on N/T is uniquely ergodic. 

Since F is dense in G ~ N/M, Theorem 13.21 implies that the action of M on N/T 
is uniquely ergodic, i.e. that the normalized Haar measure u on N/T is the only M- 
invariant probability measure on N/T. In order to translate the counting problem into an 
equidistribution question, we introduce the following counting function for x & N, 

Ff{x) = # {7 G F, X7 e A, 0(x7) e B} 

Note that is F-invariant on the right hand side, hence it really defines a measurable 
function on the nilmanifold N/T. Note further that the quantity we are interested in is 
precisely Ff{e) = #{7 G F,7 G Dt,(l){'y) G B} , and our goal (i.e. ffTS]) ) is to prove the 
convergence of F^{e)/Vt, where Vt = voIm{M Pi Dt). The main step is to prove weak 
convergence {(^^ below) of the functions Ff{x)/Vt. 

3.2. Proof of Theorems 13.11 and 11.11 Since the Haar measure u on N/T is already 
normalized, the choice of a Haar measure on G, denoted by vole, determines uniquely a 
Haar measure on the kernel M, which we denote by voIm- For every g & N we denote 
by z/f the image under n : N ^ N/T of the uniform probability measure supported on 
g-^Dt n M, i.e. 

-'-•( .oMptnM) ""'"'''^') 

We let he a continuous function on N/T and we consider the scalar product 

J N/T Jn 

Decomposing the Haar measure on N along the fibers of the projection cf) : N N/M, we 
obtain, 

(20) (Ff,^) = [ Inig) [ iDM^m^olM^dy^oUdg) 

Jg Jm 

= [ volM{g-'Dt nM)( [ i,{gz)u^{dz)] volaidg) 

JB \JN/T J 



16 EMMANUEL BREUILLARD 

In order to go further, we need the following proposition, which is the consequence of 
the unique ergodicity of the M-action on N/T. 

Proposition 3.3. The following weak convergence of probability measures on N/T holds 
uniformly when g varies in compact subsets of N. 

lim vf = V 

Proof. Let z/qo be a weak limit of vf as t +00 and g converges to some element in N . 
Since the M-action is uniquely ergodic on N/T by Theorem I3.2[ it is enough to show that 
i^oo is invariant under M. Let /if be the probability measure on N such that z/f = '/r^,(/if) 
as defined in (|T9l) . The map vr* : V{N) V{N/T) between spaces of probabihty measures 
is an iV-equivariant contraction for the total variation norm, hence to show that z/qo is 
invariant under M, it is enough to prove the following lemma. 

Lemma 3.4. For any h G M, the following convergence holds uniformly in g as g varies 
in compact subsets of N 

lim \\5h* fJ'f - fJ'fW =0 

Proof. Since h e M, the measures fif and 6h * fJ^f are supported on M and are absolutely 
continuous with respect to the Haar measure voIm- Hence the total variation norm is 
simply the norm. So 

...^ l|. , volM{Mn{h-'g-'DtAg-'Dt)) 

(21) ¥>h * fx't - fJ^t = / fMr. -ln^ 

vohiiMng ^Dt) 

where A is the symmetric difference operator. Note that combining both properties (u) 
and {Hi) of Definition I2.13[ the following convergence holds uniformly when g varies in 
compact subsets of N . 

(22) i„„ -^-"'Mto-AnM) _ 

Similarly, by property (ii), if g lies in a compact set, there will be some positive function 
£i > with — >• such that we can write h~^g~^DtAg~^Dt C Dt+tet\Dt-teti But by 
property {iii) we can conclude that 

(23) lim «'"M((A....\A-. )nM) ^ ^ 

^ ' t^+oo volM{Mr\Dt) 

Combining (El]) with ([22]) and ({23]) we are done. □ 

□ 

Let us resume the proof of Theorem 13.11 Since B is bounded and G -B in the integral 
( 120]) . when t tends to +00 we obtain the weak convergence: 

(24) ]im yfr^f!,., =volGiB)- [ ^{z)dv{z) 

voluyDt n M) j^jY 
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In order to get the desired asymptotics for F^{e), we need to compare it to {F^, ip^ where ifj 
is chosen to better and better approximate the Dirac distribution 5e- For every sufficiently 
small neighborhood of the identity U in A^, which is homeomorphic to U via the covering 
TT, we may consider a bump function ip supported on iiiU) (i.e. a non-negative continuous 

function with total sum equal to 1). Then Ff{e) < Ff^^^{g) for any g & U hy property 
{ii) of nicely growing subsets. Note also that volc{<t'{U)B) gets closer and closer to volc{B) 
as U tends to the identity. It follows that Ff{e) < (Ff^f^^ip) and from property {Hi) of 



t+o(t) 

nicely growing subsets, applying flM|) . we obtain that 

(25) lim , * ^ ^ < voIg[B) 

o O 

The lower bound is only slightly more delicate. We have F^{e) > F^{e) where B is 
the interior of B. Since B has negligible boundary, for any sufficiently small neighborhood 
V of the identity in G, there exists an open subset By of B such that VBy C B and 
voIg{Bv) — > voIg{B) as V narrows to the identity. 

Now let f/ be a neighborhood of the identity in N so small that (/)(f/~^) C V . Then for 

any g eU, F,^(e) > F^J,Jg) for any geU.li follows that (e) > (FfZu),i^) ^or any 



bump function supported on vr(f/). Again from property {in) of nicely growing subsets, 
applying ( l24l) . we obtain 

lim , * > volciB). 

t—ooVolM{DtnM) - ' 

We have proved 

lim , * = voIg{B) 

which, together with the volume estimate given by property {Hi) of Definition 12.131 ends 
the proof of Theorem 13. 1[ 

Proof of Theorem \l.l[ If T is the torsion subgroup of F, the map factors through T 
to a quotient map (p : T/T —>■ G. Since T is finite, there is an integer c > such that 
gn-c .T c S"". It follows that n 0^^(5)1 ~ |T| • |S" n ^'\b)\ where is the image 
of in F/T. Moreover the S" 's is a family of nicely growing subsets of F/T according to 
Corollary I2.20[ Therefore Theorem 11.11 follows from Theorem 13. 1[ □ 

Proof of Corollary \l.^ Apply Theorem 11.11 to B = U and B = V with cf) = id and take 
the ratio. □ 

We finally state one last result which generalizes Theorem 13.11 in an obvious way (we 
assume here G simply connected) and whose proof we only sketch because it is entirely 
analogous to the proof of Theorem 13. II we just gave and presents no additional difficulties. 
Recall from Section [231 Lemma 12. 161 that Moo = linit^+oo^iM is a connected subgroup of 
Nq. Let Cm > be the constant from Lemma [2.171 We have: 
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Theorem 3.5. For any bounded Borel subset B of G with volcidB) = we have the 
following weak convergence of measures on N, 

tH+oo T-^CrMCG) 5Z As^i,) = CM-volG{B)-volM^ 

where A.^ is the Dirac mass at x. 

Proof sketch. Let / > be a compactly supported function on N, let h = 1b and set 
Ft{x) = J2^(zr f{^i,{xi))h{4>{xj)) for x e N. Let Vt = jj^ f{5i^{m))dvolM{m). By Lemma 
\2.n\ Cm ■ J fdvolMoo- When this is positive, the unique ergodicity of M 

on N/T yields as above the weak convergence (^F^ /Vt — *■ voIg{B) ■ ^^j^ipdv. The 
point-wise convergence is derived in a similar way. □ 

3.3. Local limit theorem for random walk averages. In this last paragraph we prove 
Corollary ll.3[ The proof makes use of the results obtained by Varopoulos and Alexopoulos 
about convolution powers of measures on nilpotent groups. We first explain some termi- 
nology and refer the reader to |1] for the details. 

Let yU = X]s65 ^ symmetric probability measure on G, whose finite support 5" 
generates a dense subgroup F of G. Let : — > G ~ N/M be the map given by Malcev's 
rigidity as before. We now define the sub-Laplacian associated to n on iV, by setting 

n\+n2 ^ 

L^= hiXi + - ^ aijXiXj 

where the Xj's are A^-left invariant vector fields on that form an adapted basis for the 
Lie algebra n (i.e. n = ©p>impand C"^(n) = ®p>qmp where C"^(n) = span{Xi,di > q} and 
Up = dimmp). We let {St)t>o be the corresponding one-parameter group of dilations. The 
coefficients are defined by 



a, = if 1 

Xidfi{x) if ni < i < ni + n2 



By definition, the heat kernel {pt)t>o associated to /i on G is the solution to the heat 
equation for that is 

(26) ^ = L,pt 

Since F is dense in G, the matrix (aij)i<jj<„j is non-degenerate and equation (126|) has a 
unique solution {pt)t>o such that pt is smooth and positive everywhere with total integral 
over N equal to 1. We will also need to consider the "heat kernel at infinity" (p^ )t>o defined 
by the same equation, except that the Xj's are replaced by the corresponding left-invariant 
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vector fields for tlie graded group structure Nq associated to tlie group of dilations {St)t>o 
(see I2.4.2I) The (p? )t>o enjoy the scaling property 

(27) t^p°(5^(^))=p?(x). 
Let I ■ I be a quasi-norm on A^. We have: 

Theorem 3.6. Corollary 1.19) For every a, < a < |, there zs Cq, > such that 
Wn G N, V7 G r, 

(28) n-^ l^-(^) < ^exp 

Theorem 3.7. ^[IJ, Theorem 6.6) 3c > 0, G N, Vx 

(29) \Vn{x) -pI{x)\ < 

Theorem 3.8. Corollary 6.5) Let di = {Xi). 3c > 0, VA; G N, Vzi, ...,ik > 0, Vt > 0, 

Proof of CorollaryTM We have /i*"(S) = 'Zje^^-^B) /^*"(7)- Fix a e (0, |) and let e > 
with 6 ■ (d(r) - d{G)) < a. Then combining ([28]) and ^ we get E|^|>„^+. /^*"(7) < 
^'^'^^'^ ' ^^pil^) whlch teuds to as tends to +00. The same holds for Pn(7) 

or Pnil) in place of /i*"(7). So when computing ij,*^'{B), we may as well restrict to 
counting points 7 G -B with I7I < 722+^. According to Theorem 13.11 4^{'y G F, I7I < n, 7 G 
(j)~^{B)} = 0{n'^^^^~^^'^^), hence combining this estimate with fl28l) we get, 



n— /i*"(5)= n— p„(7) + 0(1) = n— p°(7) + o(l) 

|7|<n2+=',7e</.-i(B) 7e0-l(i?) 

By the scaling property (I27|) n~^p°(7) = n~ ^ '2 ^ *Pi(^^(7))- Then by Theorem 



n'^r{B) = ^^^ J2 pUS^il)) + o{l) = CM-volG{B)- f p'.dvolM^ + o{l) 
n 2 ^g^-i(B) " "'A^oo 



□ 
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